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Abstract 

Jaynes' transformation group principle can allow the computation of the prior density func- 
tions describing minimal knowledge for a velocity. The improper prior is uniform in the un- 
bounded velocity space of classical mechanics. In the relativistic case, however, it reads ~p(/3 x , j3y, /3z) = 
a x [1 — + Py + [3z)]~ 2 , but it can be rewritten as a uniform volumetric distribution, when 
the velocity space is given a non-trivial metric. 

1 Introduction 

Physicists seldom escape the need to express a state of incomplete knowledge of parameters describing 
a physical system. When certainty is not at hand, the appropriate language to describe information 
is probability. The Bayesian approach to probability has proved to both be free of the inconsistencies 
induced by the classical "frequentist" theory, and to provide an elegant solution of many controversial 
problems, particularly in statistical mechanics [[!]. However, the necessary assignment of a prior 
distribution describing minimal knowledge can be, when not neglected, a major difficulty. The 
search for the prior density function can lead to appropriate non-trivial results, even in the most 
elementary problems, as pointed out in the following, with the example of the velocity of a particle, 
in classical and special relativistic mechanics. 

2 Minimal information and measure 

I hereafter take the Bayesian viewpoint on probability theory jl| ||] , according to which the probability 
is a real-numbered measure of one's belief in the validity of a logical proposition A, given incomplete 
knowledge C. By incomplete, I mean that C does not allow one to establish the truth or falsehood 
of A with certainty. In this scheme, a probability is always conditional, in the sense that it can be 
assigned to a proposition A, only assuming some previous knowledge C. This is clearly recognized 
by denoting the probability of proposition A given information C as p(A\C). 

Let a; be a parameter taking its value in a given range, and characterizing the physical system under 
study ( from a chosen point of view ). The knowledge C obtained by some observation or experiment 
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is then usually represented by a probability density that will be noted, following Tarantola |3| by an 
ovcrlined symbol, e.g. : 

7, :•<• -Jj-n. (1) 

Proposition A can be a statement about the value of x (e.g.: A = (x 6]xi,X2[) ). The density f x 
allows then the computation of the probability p(A\C) by the usual relation : 

p(A\C)= [ X2 7 x (x)dx. (2) 

If a different parametrization y — T{x) is used to describe the system, then the information C induces 
a probability density / related to f x by 

7 x {x)=ly{v)\M- (3) 

where \Jt\ stands for the Jacobian of the change of parametrization T. Equation (j^) ensures that 
the probability of proposition A = (x s]xi,X2[) = [y G T(]xi,X2[)] given information C is attributed 
the same value whether the x or the y parametrization is used. This is hereafter referred to as the 
relation of conservation of probabilities. 

Now, prior to any observation, complete ignorance itself is also a state of knowledge, and it should 
be described by a density function (d.f.) too. Indeed, the raw definition of the parameter x and 
the properties it possesses are inevitably associated with some information / : the state of minimal 
information. The corresponding d.f. will be noted ~p x and is usually termed the least informative 
d.f., or the prior d.f. It must be emphasized that the form of JI X is generally not trivial, in particular, 
~p x is not necessarily a constant. Evidence of this is given by the application of Eq.(|§|) to two 
parametrizations x and y, where minimal knowledge is described by 

~Px( x ) =7*3/(2/) \M- ( 4 ) 

Hence the prior JJ cannot be constant in, for example, two parametrizations x and y related by a 
nonlinear transformation T. As a consequence, a general method of inference of the form of ~p must 
be sought. 

Jaynes 0] has proposed a "transformation group" method relying on the basic principle that "the 
state of minimal information is described by the same d.f. in two different parametrizations in 
which the problem is equivalently defined" , or, more widely stated by Tarantola and Valette [||, "the 
least informative d.f. is form-invariant under the transformations that leave invariant the equations of 
physics" . Therefore, in two parametrizations x and y related by such a transformation (i.e. y = T(x) 
), Jaynes' principle reads : 

~P X =~P y ( Le - V 2A Tiyiv) =~Px(v) )■ ( 5 ) 

Thus, the form of this function is constrained by the relation of conservation of probabilities (H), 
which imposes the constraint 

-p x (x)dx = JI x [T(x)}d[T(x)]. (6) 

This relation must be valid for all x and for every allowed transformation T. 

In a later version Q of his probabilistic approach to inverse problems, Tarantola suggests the use 
of volumetric probabilities /z instead of probability densities ~p. Let x = (x , x 2 , • • ■ , x") be a given 
parametrization of the physical system under study. Probability densities / have to be multiplied by 
the differential element dx = dx 1 dx 2 ■ • ■ dx™ of the coordinates to get an infinitesimal probability, 
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whereas volumetric probabilities / have to be multiplied by a volume element dV(x 1 , x 2 , ■ ■ ■ , x n ) to 
get the same equality 

f x {x 1 , x 2 , ■ ■ ■ , x n ) dx 1 dx 2 ---dx n = f(x\x 2 ,---,x n ) dVix 1 ,x 2 ,---,x n ). (7) 

Tarantola chooses then to describe ignorance by a constant volumetric probability fi = const. This 
arbitrary choice corresponds to the "natural" feeling that in the case of minimal knowledge, the 
probability is proportional to the volume element^]. From this choice, the search for the correct 
probability density ~p x turns into the search of the right volume element dV(x). Indeed, Eq. (^) 
becomes 

TL x {x 1 ,x 2 r --,x n )dx x dx 2 ---dx n =dV{x 1 ,x 2 ,---,x n ) x const. (8) 

From this point of view, Jaynes' principle can be stated as : "In two parametrizations where the 
laws of physics take the same form, the volume element has the same form" . In other words, the 
volume element is unchanged under a transformation T that "leave invariant the laws of physics" : 

dV(x 1 ,x 2 ,---,x n ) = dV[T(x 1 ,x 2 ,---,x n )}. (9) 

For a sufficiently well-defined parameter, these conditions should permit the identification of an 
"objective^" volume element. Furthermore, should the parameter space, where x is defined, be given 
a metric, its form can be constrained. Indeed, if the line element is 

dl 2 = 0y dx 1 dx 1 , (10) 

the volume element is given by the square root of the determinant of the metric gij multiplied by 
the differential of the parameters, that is : 



dV(x\x 2 ,---,x n ) = ^\g~~\dx 1 dx 2 ---dx n . (11) 

We will hereafter use this principle to infer the appropriate least informative d.f. for the velocity 
of a particle with non-zero rest-mass, using both classical and relativistic mechanics, and the corre- 
sponding norms. 



3 Velocity in classical mechanics 

Let K and K' be two inertial reference frames where observers O and O' respectively, are at rest, 
and such that for each of them, the other is moving along the x (or x') axis in his reference frame. 
Let P be a point-like material particle, moving freely along this common axis. The velocity of P can 
be indicated by its value v x for O and v' x for O' . Then, v x and v' x are related by the classical law of 
addition of velocities 

v' x =v x -V r , (12) 

where v r is the relative velocity of O' as seen by O. 

Let Jl x be the d.f. which represents minimal knowledge of this velocity for observer O, and ~p! x the 
corresponding least informative d.f. for the velocity of P as seen by O' . The principle of Galilean 

1 This is perhaps where the automatic use of a constant probability density to describe ignorance originates, by 
misusing the differential of the coordinates as the volume element. 

2 in the sense that it takes the same form whether one uses the x or the y = T(x) parametrization. 
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relativity states that neither of the two observers is in a special situation with respect to the laws 
of mechanics. Therefore, by Jaynes' principle, both observers O and O' must describe minimal 
knowledge of the particle's velocity by the same d.f., that is 

~Px = ~P'x • (13) 

Now, v x and v' x can be seen as two different parametrizations for the same physical system. The 
rule of conservation of probabilities (||) holds, and therefore the d.f. ~p x must obey the equivalent of 
Eq.(|), i.e. : 

~p x (v x ) dv x = ~p x (v x - v r ) dv x (14) 

This relation must be valid for all v r and all v x . This implies 

~fix( v x) = const. (15) 

Note that, as — oo < v x < +oo, this d.f. is an improper priorQ (this is a common feature of a least 
informative d.f.). 

In terms of volumetric probabilities, a comparison of the result ( |l5| ) and Eq. (||) gives immediately 
the correct volume element : 

dV(v x ) cc dv x . (16) 

The former case can be extended to 2 and 3 dimensional cases, in which the velocities measured by 
two equivalent observers are related by 

v' = R(v-v r ), (17) 

where v', v and v r are 2 or 3- vectors respectively standing for the velocity of : P seen by O' , P seen 
by O, and O' seen by O. R is, depending on the case, a 2-D or 3-D rotation matrix. 

Since O and O' are equivalent observers, they must describe minimal knowledge of the velocity by 
the same d.f. However, as previously, v and v' can be seen as two different parametrizations of the 
same system and hence, the rule of conservation of probabilities implies : 

M>) dv = Jlv [R(v - v r )] dv! . (18) 
The Jacobian of the transformation ([jj]) being equal to 1, this also leads to 

~p v (v) = const. (19) 
The deduction of the volume element is, as previously, straightforward. For the 2-D case, we get 
dV(v x , v y ) = const, x dv x dv y 

dV(v x , v z ) = const, x dv x dv z , (20) 
dV{v v , v z ) = const, x dv v dv z 

and for the 3-D case, we get, equivalently, 

dV(v x , v y , v z ) = const, x dv x dv y dv z . (21) 

This allows the computation of an "objective" metric in velocity space. For simplicity's sake, it is 
convenient to use the polar coordinate system (v, 0, tp) related to the cartesian coordinates (v x , v y ,v z ) 
by the relations : 

v x — v cos 6 

v y = v sin 9 sin ip (22) 
v z — v sin 9 cos ip 
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For reasons of isotropy, the length element must take the form : 

dl 2 = h{v) dv 2 + f 2 (v)[d6 2 + sin 2 9 dip 2 } . (23) 
Equation ( [l6| ) corresponds to a fixed direction of v, i.e. 9 = 0. This leads to dl 2 = g\ Y =const., say : 

9li = o 2 . (24) 
In polar coordinates, equations (|2^) and ( ^l|) read 
dV(v,6) = const, x vdvd9 



(25) 

dV(v, 9, ip) = const, x v 2 sin OdvdOdip . y ' 
This imposes fiiv) = const, x v 2 , that is 

922 =t V l ■ 2. (26) 

g 33 = b 2 v 2 sir 9 y ' 

and again, symmetry considerations impose a = b. Thus the invariant metric of the velocity space 
in classical mechanics takes the Euclidean form 



dl 2 =a 2 [dv 2 + v 2 (dd 2 + sin 2 9 dip 2 )} 
= a 2 (dv 2 + dv 2 +dv 2 z ). 



■2 ,,1,,2 > : . .2 , ( 27 ) 



This length element, together with the choice of a constant volumetric probability to describe minimal 
information, ensures that the prior for the velocity expressed in cartesian coordinates is constant, in 



one, two and three dimensional cases, as required by Eqs. (15) and (]l 



4 Velocity in special relativity 

We now express the velocities in terms of their ratio to the speed of light (i.e. : (3 = v/c). The 
difference from the former section is that, instead of the Galilean laws for the addition of velocities, 
the relativistic laws have now to be applied. 
In the one- dimensional case, the relativistic equivalent of Eq. ( |l2| ) reads 

/3'=f^, (28) 
1 — a p 

where (3' = v'/c and a = v r /c. It is convenient to use the parametrization defined by b = arctanh/3, 
a = arctanha and b' = arctanh/3', in which Eq.(E8|) takes exactly the same form as Eq.(fL2h, that is : 



b' = b-a. (29) 
Consequently, Jaynes' principle leads to : 

Jl b (b) = const. (30) 
Back in the (3 parametrization (using Eq. [| ), this becomes 

const. . . 

MP) = Y-fi- (31) 

Note that as —1 < (3 < 1, this d.f. is not normalizable, cf Eq. (fla). 
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For the two and three dimensional cases, the relativistic law of addition of velocities reads : 



I3' x = (f3 x -a)/(l-ap x ) 

(3' z =f3 z (l-a 2 )^ 2 /(l-af3 x ) 

It is more convenient to use the parametrization ((3, cos 9, ip) related to (/3 X , (3 y , (3 Z ) by 

(3 X = (3 cos 9 
(3 y — (3 sin 9 sin ip 
(3 Z = (3 sin 9 cos ip 



(32) 



(33) 



and similarly for the primed reference frame. These equations lead, after some calculation, to the 
following relations between (/3, cos 9, ip) and ((3', cos 9', tp') 




= [(1 - ap cos 9) 2 - (1 - I3 2 ){\ - a 2 )]^ 2 (l - a [3 



cost 



cos 9' — (f3 cos 6 — a) [(1 — a/3 cos t 



(l-^Xl-o 3 )]- 1 /" 



(34) 



One can check that the Jacobian of such a transformation allows the writing of a simple equality : 

(35) 



(1 _^, 2)2 d/3' d(cos 9') dip' = f d(3 d(cos 9) dp . 



The equivalent of Eq.(||), expressing both Jaynes' principle and the principle of special relativity 
reads 



M/3cos9 V (/3'. cos 9', tp') d/3' d(cos0') dip' = (J- 0coseip (0, cos6>, <p) d/3d(cos6») dip , 



(36) 



which means that two observers O and O' at rest in two inertial frames K and K' moving with a rela- 
tive velocity a, describe the minimal information about a particle's velocity (/?, cos 9, ip) by the same 
d.f. ~pp cos g v - Comparison of Eqs. ( |35| ) and (|36|) yields an obvious solution for ~pp cos g< p (f3, cos#, tp), 
which reads : 



M/3cos<v(/ 3 > COs6 '>^) 



ft 2 



{l-(3 2 f 



x a 



(37) 



where a is a constant. 

We can use isotropy arguments to assert that the corresponding metric of the parameter space 
must take the form 



dl 2 



9l (p)dl3 2 +g 2 (l3) 



d(cos(9) 5 
sin 2 (9 



sin 2 9 dip' 



(38) 



So that we have from (38) and d3l"|), for the fixed, known direction of the velocity 9 — 0, d9 = 



1/2 _ 



1 -(3 2 ' 

The more general three-dimensional case similarly leads from (|3^) and ([39]) to 

(3 2 



\tn(P)92{p) 



2ll/2 



= b x 



(1 -p 2 ) 2 



(39) 



(40) 



G 



We can thus infer the form of the two functions g\ and gi- Imposing that when (3 tends towards 
(i.e. the velocity of the particle is low compared to c), the metric takes the classical form (p7j), it is 
straightforward to obtain 

B 2 1 
■ d ff 2 + ,-, ( d6>2 + sin 2 fldtp 2 ) . 
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(i -FY 



(1-/32) 



(41) 



It is then purely a technical matter to get the corresponding form of the metric tensor for the 
Cartesian (/3 X , (3 yi (3 Z ) coordinates. The result is 



[fJr 



[^-{Pl + Pl + PW 



1 



-01-01 

PxPy 
PxPz 



1 



1 



PxPz 
PyPz 

-0.-0 



(42) 



PxPy 
-PI- PI 

PyPz 

This metric allows the direct computation of the prior for a velocity in one, two, and three dimensional 
cases. In particular, both Eqs. ( ^7| ) and ([42| ) yield : 

P/W. W*> P» P>) = [1 _ { p2 + a (3 2 +m 2 ( 43 ) 



5 Comments 

The volume element induced by Eq. (^lj) leads to an infinite volume for the whole velocity space 
— 1 < j3 < 1, < 6 < ir, < (p < lix. This is also related to the non-normalizability of the d.f. 
~Pp cos dip inferred in Eq. (37). As (3 tends towards 1, the "distance" between two "close" points 
l([P, P + d/3]) tends to infinity. This should have been expected, since the equivalent classical case is 
v — > +oo. The length element ( f4l| ) also implies that in velocity space, the invariant "distance" of a 
point to the sphere f3 = 1 is infinite, in any reference frame, therefore, the statement that a velocity v 
is "close" to c is meaningless. An other consequence is, that choosing 0.95 c as an "average" value for 
v, when one has the information 0.9 c < v < c, appears logically nonsense. In any case, in the energy 
parametrization E = Eq (1 — /3 2 )~ 1//2 , the corresponding knowledge would only give a lower value 
for the energy of the particle, and therefore, it would not come to one's mind to try to summarize 
this information in terms of one "average" value. 



The metric (41) and (42) is "objective" in the sense that it defines a measure element in the 
velocity space which is invariant under any Lorentz velocity-transformation, accordingly to the special 
principle of relativity. Both the metric ( f42|) and the densities (|3l|) or ( fi"3"l ) deduced from the special 
principle of relativity yield to the metric (P7|) and the densities (|15|) and ([t9]) respectively, when the 
classical approximation c — > +oo is made. 

It might appear strange that ~p xyz ^ JLJi y Ji z . Indeed, it is a counter-example of the intuitive 
general equality postulated by Tarantola Q. Actually, in the Galilean case, the equality holds, from 
( |l5| ) and (|l^). Yet, for the relativistic case, it is clear that it is not valid anymore. Indeed, the 
appearance of transverse terms in the metric ( [42] ) makes minimal information on the velocity in one 
fixed and known direction depend on the orthogonal velocity. This is a direct consequence of the 
relativity of time, and should not therefore be surprising. It illustrates however, how important it is 
to clearly define the problem before any prior is to be inferred by Jaynes' method. In other words, 
the parameters have to be precisely characterized, and the invariance transformation groups and 
subgroups found, previously to the application of the transformation-group method. 



To summarize, the prior density functions for a velocity have been found in 1, 2 and 3 dimensional 
cases, in Galilean and special relativity, using Jaynes' transformation group method. The inference 
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of the corresponding invariant "objective" measure in the velocity space illustrates the relations 
between measure, volume element and least-informative density function. Even in this very simple 
case, the result (i.e. Eqs. (Q) and ([l3]) ) is non-trivial. As the assignment of prior probabilities 
is a necessary first step before any consistent (i.e. Bayesian) probability-based inference method is 
put into practice, this shows that the precise definition of the parameters must be given one's full 
attention. In less obvious cases, this task is more dificult, but it can also lead to important conclu- 
sions. An application of this technique to derive the minimal information description of cosmological 
parameters is underway, and appears very promising. 
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